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Abstract 



The Lovasz Local Lemma jEL75j is a powerful tool to non-constructively prove the existence of 
combinatorial objects meeting a prescribed collection of criteria. In his breakthrough paper [Bec91| . 
Beck demonstrated that a constructive variant can be given under certain more restrictive conditions. 
Simplifications of his procedure and relaxations of its restrictions were subsequently exhibited in several 
publications [AloQli IMHSSl ICSOOi IMos06> ,Sri08l FMosOSj . In |Mos09| . a constructive proof was presented 
that works under negligible restrictions, formulated in terms of the Bounded Occurrence Satisfiability 
problem. In the present paper, we reformulate and improve upon these findings so as to directly apply 
to almost all known applications of the general Local Lemma. 
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1 Introduction 

Let yi be a finite collection of mutually independent events in a probability space. The probability that none 
of these events happen is exactly nAeyi(-'- ~ Prl^])- In particular, this probability is positive whenever no 
event in A has probability 1. Laszlo Lovasz's famous Local Lemma [EL75j allows for limited dependence 
among the events, but still concludes that with positive probability none of the events happen if the 
individual events have bounded probability. Here is the lemma in a very general form. 

Theorem 1.1. |EL75| Let A be a finite set of events in a probability space. For A & A let T{A) be a 
subset of A satisfying that A is independent from the collection of events A \ {{A} U T{A)). If there exists 
an assignment of reals x : A ^ (0,1) such that 



yA£A : Pt[A] < x{A) Yl C^-xiB)), 



BGr(A) 



then the probability of avoiding all events in A is at least YlAeAi^ ~ ^i^))' particular it is positive. 
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The original proof of this statement is non-constructive and does not yield an efficient procedure for 
searching the probability space for a point with the desired property. The purpose of the present paper 
is to give an alternative, algorithmic proof that provides such a procedure. This is not the first attempt 
to do so. In |Bec91j . Beck achieved a significant breakthrough demonstrating that algorithmic versions 
of the Local Lemma exist. He formulated his strategy in terms of hypergraph 2-coloring as a specific 
application of the lemma and proved that if in a hypergraph, every edge contains at least k vertices and 
shares common vertices with no more than roughly 2^/^^ other edges, then a polynomial time algorithm 
can 2-color the vertices without producing a monochromatic edge. The existential version of the Local 
Lemma on the other hand allows for every edge to share vertices with roughly 2^ /e other edges and 
guarantees the existence of such a coloring. Subsequently, several authors have attempted to improve upon 
the gap between the existential version and its constructive counterparts. Alon improved the threshold 
to essentially 2^^^ using a simpler and randomized variant of Beck's algorithm |Alo91j . Molloy and Reed 
provided in |MR98j a general framework capturing the requirements a particular application has to meet 
so as to become tractable by the tools of Beck and Alon. A small error in this result was recently fixed in 
|PT09j . Czumaj and Scheideler's contribution in [CSOOj extended known algorithmic versions to somewhat 
more general cases where the edges of the hypergraph in question need not be of uniform size. Srinivasan 
in |Sri08j provided another improvement that reduced the gap to a threshold of essentially 2'^/^ along with 
a series of other advantages over the previous approaches. In [MosOSj . yet another variant was presented 
that achieves a polynomial running time for instances up to a neighborhood size of roughly 2*^/^ and finally 
in |Mos09j . the threshold was lowered to roughly 2^/32. In the present paper, we reformulate and improve 
upon the last cited result both so as to get rid of the now unnecessary constant in the hypothesis and so 
as to directly apply to almost all applications of the Local Lemma known so far. The only restriction we 
have to impose upon the general setting in the non-constructive version as formulated above will be that 
we consider events determined by different subsets of underlying mutually independent random variables 
and T{A) consists of all events that depend on some of the same variables as A. See the exact formulation 
below. While this appears to be necessary in order to get any algorithmic access to the problem, it seems 
as well to be the case in almost all known applications. 

Let y be a finite collection of mutually independent random variables in a fixed probability space 0. 
We will consider events A that are determined by the values of some subset C J" of these variables. In 
such a case we say that an evaluation of the variables in S violates A if it makes A happen. Clearly, if A 
is determined by T, then there is a unique minimal subset S C J" that determines A. We denote this set of 
variables by vbl(^) and assume throughout the paper that this set is given to all algorithms dealing with 
the event A. 

Let yi be a finite family of events in Q determined by 7. We define the dependency graph G = Gji for A 
to be the graph on vertex set A with an edge between events A,B ^ A\i A^ B but vbl(A)nvbl(i?) ^ 0. For 
A G A we write V{A) = Tji{A) for the neighborhood of A in G. Note that r(^) satisfies the requirement 
in Theorem 11.11 as A is determined by the variables in vbl(j4) and the events in yi \ {{A} U T{A)) are 
determined by the rest of the variables in 7. 

Given the family A of events as above our goal is not only to show that there exists an evaluation 
that does not violate any event in the family but to efficiently find such an evaluation. The algorithm 
we suggest (Algorithm 1.1) is as simple and natural as it can get: We start with a random point in O 
and maintain an evaluation vp of each variable P G J". We check whether some event in A is violated. If 
so, we arbitrarily pick a violated event A ^ A and sample another random assignment of values for the 
variables in vbl(A) on which A depends, each one independently and according to its distribution while not 
changing the values of the variables in J \ vbl(A). We call this a resampling of the event A. We continue 
resampling violated events until no such event exists anymore. We will prove that this simple algorithm 
quickly terminates, i.e., it quickly reaches an evaluation of the variables not violating any of the events in 
A if the conditions of the Local Lemma are satisfied. 
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function sequentiaLlll(J',7l) 
for all P e y do 

vp <^ a random evaluation of P; 
while 3A e A : A is violated when (P = wp : VP £ T) do 

pick an arbitrary violated event A G A; 

for all P G vbl(yl) do 

Up ^ a new random evaluation of P; 
return {vp)pe9; 



Algorithm 1.1: the sequential solver 

The efficiency of the method clearly depends upon whether random values for each variable can be 
efficiently sampled and whether they can be efficiently checked against the given events. This is the case 
for almost all known applications of the lemma and it is less restrictive than previously known methods 
which required conditional probabilities or expectations to be computed. We will analyze the efficiency of 
the algorithm in terms of the expected number of times an event ^ G ^1 is resampled. 

Theorem 1.2. Let y he a finite set of mutually independent random variables in a probability space. Let A 
be a finite set of events determined by these variables. If there exists an assignment of reals x : A ^ (0,1) 
such that 

yAeA : Fv[A] < x{A) Yl (l-^(-B)), 

then there exists an assignment of values to the variables 7 not violating any of the events in A. Moreover 
the randomized algorithm described above resamples an event A ^ A at most an expected x{A)/{l — x{A)) 
times before it finds such an evaluation. Thus the expected total number of resampling steps is at most 

l^AeA l-x{A) ■ 

Our algorithm lends itself for parallelization. In the parallel version of the algorithm (Algorithm 1.2) 
we start again with the evaluation of the variables at a random point in the probability space, then in 
every step we select a maximal independent set S in the subgraph of the dependency graph G spanned 
by the violated events and resample all the variables these events depend on in parallel. That is, we take 
independent new samples of the variables in U^^^vb^jd) and keep the values assigned to the rest of the 
variables. We continue until we find an evaluation not violating any of the events. This algorithm can 
be considered a special case of the sequential algorithm, so the statement of Theorem 11.21 applies to the 
parallel version too. In order to give a logarithmic bound for the expected number of steps we assume 
slightly stronger bounds on the probabilities of the events. 



function paralleLlll(!P,yi) 

for all P G T do in parallel 

vp ^ a random evaluation of P; 
while 3A e A : A is violated when (P = tip : VP £ T) do 

<— a maximal independent set in the subgraph of Ga induced by all events 
which are violated when (P = vp : VP e J*), constructed in parallel; 

for all P e [J vbl(A) do in parallel 

Aes 

Vp a new random evaluation of P; 
return {vp)p^y; 



Algorithm 1.2: the parallel solver 
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Theorem 1.3. Let 7 be a finite set of mutually independent random variables in a probability space. Let 
A be a finite set of events determined by these variables. If e > and there exists an assignment of reals 
X : A ^ (0,1) such that 

yAeA : Fi[A] < {l-e)x{A) {l-x{B)), 

then the parallel version of our algorithm takes an expected log ^^gy^ i~L^a) ) '''t^P^ before it finds an 
evaluation violating no event in A. 

There is not much about our algorithm that is inherently randomized. We can demonstrate that under 
additional conditions, most notably a constant bound on the maximum degree of the dependency graph, 
the same task can be performed by a deterministic procedure. 

Theorem 1.4. Let 7 = {Pi, P2, . . . , Pn} be a finite set of mutually independent random variables in a 
probability space, each Pi taking values from a finite domain Di. Let A be a set of m events determined by 
these variables. Consider the problem size to be s := Jn- + ?^ + Suppose there exists an algorithm 

that can compute, for each A A and each partial evaluation [vi G Di)nzi, I C [n] the conditional 
probability Pr[A\Mi G / : Pj = Wj] in time polynomial in s. Suppose, moreover, that the maximum degree of 
the dependency graph Gji is bounded by a constant, that is yA G A : |ryi^(A)| < k for some constant k. If 
there is a constant e > and an assignment of reals x : A (0,1) such that 

yAeA : Pt[A] < {l-e)x{A) JJ {l-x{B)), 

BeVji {A) 

then a deterministic algorithm can find an evaluation of the variables such that no event occurs in time 
polynomial in s. 

In Sections 2 and 3 we provide an analysis of the algorithm, proving Theorem 11.21 in Section 4 
we analyze the parallel version and prove Theorem 11.31 In Section 5, we prove the correctness of the 
derandomized algorithm as claimed in Theorem II. 4i Section 6 contains a lopsided version of Theorem 11.21 
and Section 7 has concluding remarks. 

2 Execution logs and witness trees 

Note that the decision which violated event A £ A to correct in each step can be taken completely 
arbitrarily. Let us fix any (deterministic or randomized) procedure for this selection, this makes the 
algorithm and the expected values we consider well defined. The selection method the algorithm uses does 
not matter for our analysis. 

We need to record an accurate journal of what the algorithm does. Let C : N ^ ^1 list the events as 
they have been selected for resampling in each step. If the algorithm terminates, C is partial and defined 
only up to the given total number of steps carried out. We call C the log of the execution. With a fixed 
selection discipline as described above, C is now a random variable determined by the random choices the 
algorithm makes. 

Recall that dependency graph G is a graph on vertex set A where two distinct events A, A' £ A are 
connected if vbl(^) PI vbl(^') 7^ and that T{A) denotes the neighborhood of the vertex A in G. We also 
use here the inclusive neighborhood T~^{A) := T{A) U {A} of a vertex A. 

A witness tree r = {T^ax) is a finite rooted tree T together with a labelling : V{T) ^ yi of its 
vertices with events such that the children of a vertex u G y{T) receive labels from T^{aT{u)). If distinct 
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children of the same vertex always receive distinct labels we call the witness tree proper. To shorten 
notation, we will write V{t) := V{T) and for any v G ^(t), we write [v] := (Jt{v). Given the log C, we will 
now associate with each resampling step t carried out a witness tree Tc{t) that can serve as a 'justification' 
for the necessity of that correction step. Let us define T^\t) to be an isolated root vertex labelled C{t). 
Then going backwards through the log, for each z = t — — 2,...,1, we distinguish two cases. If there is 
a vertex v € T^,^^\t) such that C{i) € r"'"([f]), then we choose among all such vertices the one having the 
maximum distance from the root and attach a new child vertex u to f that we label C{i), thereby obtaining 
the tree T^{t). In the selection of the maximum distance vertex we break ties arbitrarily. If there is no 
vertex v € T'^'^^\t) such that C{i) G r"''([v]), then we skip time step i and simply define T^*^(t) := T^^'^\t). 
Finally let Tc{t) := r^^^(t). 

We say that the witness tree r occurs in the log C if there exists t £ N such that Tc{t) = t. 

Lemma 2.1. Let r be a fixed witness tree and C the (random) log produced by the algorithm. 

(i) If T occurs in C , then t is proper. 

(ii) The probability that t appears in C is at most JlDev(r) -P^il"^]]- 

Proof. Assume r occurs in the log C, so we have Tc{t) = t for some t G N. For a vertex v € V{t) let 
d{v) denote the depth of vertex v, that is its distance from the root and let q{v) stand for the step of the 
algorithm constructing Tc{t) in which v was attached, that is, q(v) is the largest value q with v contained 
inr(^)(t). 

First note that if q{u) < q{v) for vertices u,v £ V{t) and vbl([ti]) and vbl([u]) are not disjoint, then 
d{u) > d{v). Indeed, when adding the vertex u to TQ^^^^^\t) we attach it to v or to another vertex of 
equal or greater depth. As a consequence observe that for any two vertices u,v € V{t) at the same depth 
d{v) = d{u), [u] and [v] do not depend on any common variables, that is the labels in every level of r form 
an independent set in G. In particular r must be proper, establishing claim (i). 

Consider the following procedure that we call r-check: In an order of decreasing depth (e.g., reversed 
breadth first search order) visit the vertices of r and for a vertex v take a random evaluation of the variables 
in vbl([f]) (according to their distribution, independent of possible earlier evaluations) and check if the 
resulting evaluation violates [v]. We say that the r-check passes if all events were violated when checked. 

Trivially, the r-check passes with probability exactly nugy(r) -'^^[t^]]- '^^^ lemma follows from the 
observation that whenever r occurs in the log and we run the r-check on the same random source it passes. 
For this coupling argument we have to specify explicitly how the algorithms use the random source. We 
assume that for each variable P € T the random source produces an infinite list of independent random 
samples P^^\P^^\ . . ., and whenever either algorithm calls for a new random sample of P we pick the next 
unused value from this sequence. 

Having assumed that r = Tc{t), we need to prove that the r-check passes, that is, when it considers a 
vertex v G V{t) and takes random samples of the variables in vbl([f]), the resulting evaluation violates [v]. 
Let us fix the vertex v and for P € vbl([f]) let S{P) be the set of vertices w G V{t) with d{w) > d{v) and 
P € vbl([t(;]). When the r-check considers the vertex v and samples P the random source gives p(l'^(^)l). 
This is because the r-check visits the vertices in order of decreasing depth and among the vertices with 
depth equal to d{v) only the label of v depends on P, so before the r-check considers v it had sampled P 
exactly when it was considering the vertices in S{P). 

In step q{v), our algorithm chooses the event [v\ for resampling, so [v\ must be violated before this 
resampling. We claim that for P G vbl([u]) the current value of the variable P is at this time. 

Indeed, P was sampled at the beginning of the algorithm and then at the steps q{w) < q{v) for w G S{P). 
As the r-check has these exact same values for the variables in vbl([v]) when considering v it also must 
find that [v\ is violated, proving (ii). □ 
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For any event A A, let us denote by Na the random variable that counts how many times the event 
A is resampled during the execution of our algorithm. If C is the log of the execution of our algorithm, 
then Na is the number of occurrences of A in this log and also the number of distinct proper witness trees 
occurring in C that have their root labeled A. The latter statement holds because if ti is the i-th time step 
with C{ti) = A, then obviously the tree TciU) contains exactly i vertices labelled A, thus TciU) 7^ Tc{tj) 
unless i = j. Therefore one can bound the expectation of Na simply by summing the bounds in Lemma [2. II 
on the probabilities of the occurrences of the different proper witness trees. In the next section we do just 
that by relating these probabilities to a random process. 



3 Random generation of witness trees 

Let us fix an event A A and consider the following multitype Galton- Watson branching process for 
generating a proper witness tree having its root labelled A. In the first round, we produce a singleton 
vertex labelled A. Then in each subsequent round, we consider each vertex v produced in the previous 
round independently and, again independently, for each event B € r"'"([f]) identical or adjacent to [v] in 
the dependency graph, we add to u a child node carrying the label B with probability x{B) or skip that 
label with probability 1 — x{B). All these choices are independent. The process continues until it dies out 
naturally because no new vertices are born in some round (depending on the probabilities used, there is, 
of course, the possibility that this never happens). 

Let x'{B) := x{B) Y\cer{B)(^ ~ ^(^))- probability that the described Galton- Watson process 

yields a prescribed proper witness tree we obtain the following. 

Lemma 3.1. Let r a fixed proper witness tree with its root vertex labelled A. The probability pr that the 
Galton- Watson process described above yields exactly the tree r is 

p^='-^ n -'(H)- 

^ ^ veV{T) 

Proof. For a vertex v G V{t) we denote by ^ r^([i;]) the set of inclusive neighbors of [v] that do 
not occur as a label of some child node of v. Then clearly, the probability that the Galton- Watson process 
produces exactly r is given by 



= 7^4) n f^(N) n(i-^(M))), 



where the leading factor accounts for the fact the the root is always born. In order to get rid of the W^, 
we can rewrite this expression in an obvious way to obtain 

1 - TT / ^(M) TT n l^^ 1 

^ ' v&v(t) \ ^' «6r+{H) / 

where again we have to account for the root separately. Replacing inclusive by exclusive neighborhoods, 
this simplifies to 

f' = ^w^ n f-(M) n (i--(M)))=i^ n 

□ 
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Let 7a denote the set of all proper witness trees having the root labelled A. We have 



E{Na) = Pr[r appears in the log < J] IT ^^t^] - 11 ^'(N)> 

where the first inequality follows from Lemma |2. 11 while the second follows from the assumption in Theo- 
rem [L2j We further have 

where the equality comes from Lemma [3. 1|, while the last inequality follows from the fact that the Galton- 
Watson process produces exactly one tree at a time (not necessarily one from T a since it might also grow 
infinite). This concludes the proof of Theorem 11.21 □ 



4 Analyzing the parallel algorithm 

Let us consider an arbitrary execution of the parallel version of the algorithm. We choose an arbitrary 
ordering of the violated events selected for resampling at each step and consider that these resamplings are 
done in that order sequentially. This way we obtain an execution of the sequential algorithm. Let Sj be 
the segment of the log C of this execution that corresponds to resamplings done in step j of the parallel 
algorithm. We call the maximal depth of a vertex in a witness tree the depth of the tree. 

Lemma 4.1. Ift&Sj, then the depth of Tc{t) is j — 1. 

Proof. Let t^ be the first number in the segment Sk and let = T^''\t) for k < j. As the events 
resampled in the j-th parallel step are independent, the root is the only vertex of Tj. For k < j we obtain 
Tfc from Tk+i by attaching some vertices corresponding to the A:-th parallel step of the algorithm. As these 
vertices have independent labels they can only add one to the depth. To see that they do add one to the 
depth consider a vertex v of t^+i of maximal depth. This vertex corresponds to a resampling of the event 
[v] some time after step k of the parallel algorithm. If has no vertex with higher depth than v, then 
from the parallel step k to the resampling corresponding to v no event from r"'"([u]) was resampled. But 
this implies that [v] was already violated at parallel step k and we did not select a maximal independent 
set of violated events there for resampling. The contradiction shows that the depth of is indeed one 
more than that of t^+i- To finish the proof notice that Tc{t) = ti. □ 

Let Q{k) denote the probability, that the parallel algorithm makes at least k steps. By Lemma 14.11 
some witness tree of depth k — 1 must occur in the log in this case. Notice that a depth k — 1 witness tree 
has at least k vertices. Let 7A{k) be the set of witness trees in 7a having at least k vertices. We have 

Qik)<Yl E Pr[r appears in the log C] < J] P^N] < E E 11 ^'(N)' 

AeArelAik) A£ATe7A(k)veV{T) AeAreJAveVir) 

where the last inequality follows from the assumption in Theorem 1 1.3i Then, as before, we have 

AeArelAveViT) AeA ^ ' T&7A(k) AeA ^ ' 

This bound easily implies Theorem 11.31 □ 
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5 A deterministic variant 



It is possible to derandomize our algorithm under the additional assumptions listed in Theorem 11.41 The 
idea is to create a list of all large potential witness trees which, if done with care, remains polynomial 
in length, and then to search for a table of evaluations of all random variables which, substituted for the 
random source in the original randomized variant, guarantees that the running time will be low. 

To begin with, let us note that we can assume the weights x{A) for A € ^1 to be bounded away from 
1. If they are not, we can simply replace x{A) by, say, x{A) := (1 — €/2)x{A) for all A and e by e/2. It 
is easily checked that all requirements are still satisfied and now the weights x{A) are upper bounded by 

(1-6/2). 

Let (up'^ G A) i<i<n,jGN be sequences of values for the variables. Suppose we replace the random 
source by these sequences in such a way that we take vf as the j-th. sample and now run either the 
parallel or the sequential algorithm as detailed in the previous sections. Recalling the proof of Lemma l2.lt 
let us say that a witness tree r is consistent with the sequences (up^ G Di)i<i<n,jeN if the r-check passes 
when substituting them for the random source. We have proved that if the values vf are selected at 
random, then the expected number of witness trees which are consistent with them and which feature at 
least k vertices is bounded by (1 — e)'^ '^aga i-L"(a) weights are bounded away from 1, this is 

in turn bounded by 0(m(l — e)'^). There exists therefore a constant c such that the expected number of 
witness trees of size at least clogm that are consistent with the sequences is at most 1/2. With probability 
at least 1/2, no consistent witness tree of size at least clogm exists at all. Since no variable can then be 
reassigned more than clogm times, we can restrict ourselves to finding values G -Di)i<i<n,o<j<ciogm 
such that no large tree becomes consistent. Such values can be found sequentially using the method of 
conditional expectations. First of all, we have to make sure that the number of witness trees we have to 
consider is not too large, which is demonstrated by the following lemma. 

Lemma 5.1. Suppose the maximum degree of the dependency graph is bounded by k as required in the 
theorem. Let u G N. // there exists a witness tree of size at least u that is consistent with given evaluation 
sequences {v^''^)Ki<n,jeN, then there also exists a witness tree of some size in the range [u, {k + l)u\ that 
is also consistent with the sequences. 

Proof. To arrive at a contradiction, assume that the claim is wrong for some value u. Then let r be a 
smallest counterexample, that is a consistent witness tree of size larger than {k + l)u such that there exists 
no consistent witness tree of a size in the range [u, {k + l)u]. Due to the bounded maximum degrees of 
the dependency graph, each node in r has at most k + 1 children. Let wi,W2,. ■ ■ ,Wj with j < k + 1 he 
the immediate children of the root. Now we can build j distinct witness trees that are all consistent with 
the sequences as follows: traverse r in the usual level-by-level fashion bottom-up, starting at the lowest 
level and ending at the root. Consider this sequence of corrections to be a log C of length |t| and now 
construct for the j penultimate correction steps in C' (which obviously correspond to wi through wj) the 
corresponding witness trees Tj := tc/(|t| — i) for i = 1,2, ... ,j. For obvious reasons, each Tj is consistent 
with the sequences. Moreover, since Tj must contain at least as many vertices as the subtree of r rooted 
at Wi, there must be at least one i such that Tj has at least (|t| — l)/{k + 1) vertices. Since r has at 
least {k + l)u + 1 vertices, Tj has at least u of them, implying that Tj either contradicts the assumption or 
constitutes a smaller counterexample. □ 

A deterministic algorithm can now proceed as follows. Enumerate all witness trees that have a size 

in the range [clogm, {k + l)clogm] in a list L. There is clearly a polynomial number of those given that 

the dependency graph has bounded degrees. Now, in any ordering, go through all index pairs {i,j) with 

(i) 

l<i<n,0<j< clogm, computing suitable values for incrementally. For each of them, consider 

(i) 

each possible value of Di that we could assign to and then compute the expected number of trees in L 



8 



that become consistent with the sequences of values given that all values chosen so far and vf are fixed 
and the yet uncomputed values are considered to be random. In the beginning when no value is fixed so 
far, that expected value is at most 1/2. Clearly, in each step we can preserve that it remains at most 1/2 
by selecting a value for v\^^ that minimizes that expectation. Once all values are fixed, the expectation 
has to coincide with the actual number of consistent witness trees in L and therefore that number has to 
be zero. 

Computing the probability of each tree t L given that some evaluations are random and others are 
fixed can be done easily by traversing r in the usual bottom-up level-by-level fashion (note that each time 
we encounter a variable during this traversal we increment a counter pointing to the sample that has to 
be used as the current value for this variable), for each vertex computing the conditional probability that 
the corresponding event is violated given the values for the samples fixed so far (using the algorithm we 
assume to exist in the hypothesis) and multiplying those probabilities. 

After this polynomial preprocessing step, we run the usual algorithm substituting our tailored values 
for the random source having guaranteed that it will terminate after a polynomial number of steps. □ 

6 The Lopsided Local Lemma 

In this section we study a lopsided version of the Local Lemma that is slightly outside the framework of 
Theorem 11.11 Using lopsided dependence, which is an adaptation of the notion in |ES91j to our setting, 
we formulate our main result in a slightly more general form. We started with the original formulation of 
Theorem 11.21 because we find it more intuitive and it is general enough for most applications of the Local 
Lemma. This lopsided generalization can also be applied to the derandomized variant of Theorem ll.4[ On 
the other hand, we could not find an effective parallelization. 

We are still in the setting of Theorem II. 2t J" is a finite set of mutually independent random variables 
in a probability space and yi is a finite set of events determined by these variables. We say that two events 
A,B(^A are lopsidependent if there exist two evaluations / and g of the variables in T that differ only on 
variables in vbl(^) H vbl(-B) such that / violates A and g violates B but either / does not violate B or 
g does not violate A. The lopsidependency graph is the graph on the vertex set A, where lopsidependent 
events are connected by an edge. We write T'{A) = T'j^{A) for the neighborhood of an event A in this 
graph. 

Clearly, if vb^A) is disjoint from vbl(-B), then A and B cannot be lopsidependent, so we have T'{A) C 
T{A). Substituting T'{A) for T{A) in the statement of Theorem 11.21 makes the assumption weaker and 
therefore the theorem itself stronger. 

We call an event A A elementary, if there is a single evaluation of the variables in vbl(74) violating 
A. Lopsidependence between elementary events is more intuitive: the elementary events A and B are 
lopsidependent if they are disjoint (mutually exclusive). Elementary events form a very special case of 
events considered in the Local Lemma, but in case the variables in vbl(^) have finite domain, any event 
A is the union of finitely many elementary events determined by the same set of variables. Avoiding A is 
equivalent to avoiding all of these elementary events and such a "breaking up" of event A is not too costly 
in the following sense. If the assignment x : A ^ (0,1) satisfies the (slightly stronger than usual) condition 

VAg/I : Pr[^] < x{A) JJ (1 - x(5)), 

Ber+(A) 

then we can break up all the events A A into elementary events and find a corresponding assignment 
to the elementary events satisfying the same condition. Even in cases where a suitable assignment is not 
possible for the set A, breaking up the events may cause many of the dependencies among the elementary 
events to disappear when considering lopsidependence, and Theorem 16.11 may become applicable. 
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An important application where using lopsidependence has proven to be effective is the satisfiability 
problem. In a CNF formula, two clauses A and B are said to have a conflict if one contains a literal I and 
the other one its complement I. When conducting the random experiment of sampling a truth assignment 
to each variable independently, the (elementary) events that A is violated or that B is violated, respectively, 
are lopsidependent on one another. If, on the other hand, A and B merely overlap in some literal, then 
correcting either of them in the case of being violated will not harm the other one. Hence, being in conflict 
is the more relevant notion to analyze dependencies in a SAT problem. The lopsided Local Lemma has 
been effectively used, e.g. by Berman, Karpinski and Scott in |BKS03j . to prove better bounds on the 
number of occurrences per variable that can be allowed while guaranteeing a satisfying assignment. Proofs 
of this type can be made constructive using the method we present here. 

Theorem 6.1. Let 7 be a finite set of mutually independent random variables in a probability space. Let A 
be a finite set of events determined by these variables. If there exists an assignment of reals x : ^ ^ (0, 1) 
such that 

yAeA : Ft[A] < x{A) {l-x{B)), 

then there exists an assignment of values to the variables 7 not violating any of the events in A. Moreover 
our randomized algorithm resamples an event A ^ A at most an expected x{A)/{\ — x{A)) times before it 
finds such an evaluation. Thus the expected total number of resampling steps is at most YIa^a i~x^a) • 

As the proof of this theorem is almost identical to that of Theorem 11.21 we concentrate on the few 
differences. We define r'+(yl) = T')^{A) = T'j^{A) U {A} for A e A. Given a log C of an execution of 
Algorithm 1.1 and a time index t, we define the lopsided witness tree Qc{t) similarly to the usual witness 
tree Tc{t), but now the children of a vertex labelled A will be labelled from T'~^{A). Formally we start 
with the root labeled C{t) and going through the time steps i = t — l,t — 2, ...,lof the log we add a new 
vertex labelled C{i) every time C{i) € T'^{A) for a label ^ of a vertex of the tree constructed so far. If 
such a vertex exists we choose one as far from the root as possible and attach the new vertex of label C{i) 
to this existing vertex. 

We denote the label of a vertex v by [v] and call a witness tree a proper lopsided witness tree if the 
children of any vertex v receive distinct labels from r'"'"([f]). Clearly, gc{t) is a proper lopsided witness 
tree, but the equivalent of part (ii) of Lemma 12.11 is less trivial. We state and prove it below. One also 
has to modify the Galton- Watson branching process considered to produce a proper lopsided witness tree. 
With these modifications the original proof of Theorem [L2] carries over to the lopsided version. 

Lemma 6.2. Letr be a fixed proper lopsided witness tree andC the (random) log produced by the algorithm. 
The probability that r = Qc{t) holds for some t is at most niiey(T) ^^IM]- 

Proof. We prove this statement with the same coupling argument as in Lemma l2.ll if r = Qc{t), then 
the r-check (as defined in the proof of Lemma 12. ip passes if both the algorithm and the r-check use the 
same list of independent random samples P^'^^ , P^^^ , . . . for the variables P € J". Recall that the r-check 
considers the vertices of r in some fixed order vi,...,Vs of non-increasing distance from the root and, 
when considering Vi draws new samples for the variables in vbl([?;j]) and checks if they violate [vi]. Clearly, 
the probability that the r-check passes (meaning that each event is violated when checked) is exactly 

Let us fix the sequences P^^^ , P^^^ , . . . of random samples of the variables P G The log of our 
algorithm is still not determined because of the arbitrary choices we make in selecting the violated event 
to resample. If a possible log C satisfies Qc{t) = r for some t we denote by qc{v) the time step the vertex 
V was added during the construction of Qcit). In particular we have qcii") = t for the root r of r and we 
have C{qciv)) = [v]. If there are several possible logs C with r = Qcit) for some t, then we choose the 
one that minimizes w{C) := X]i=i(s + 1 — i)qc{vi)- We claim that with this choice we have qc{vi) = i for 
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i = 1, . . . ,s. Assuming the claim it is clear that the r-check passes since when it considers Vi it will draw 
the same random values for the variables in vbl([ui]) that makes the selection of C{i) = [vi] a valid choice 
for resampling in the ith step of our algorithm. 

To prove the claim above notice first that the definition of lopsidependence ensures that if C is a 
possible log of our algorithm and for some j the events C(j) and C{j + 1) are not lopsidependent, then 
the log C obtained from C by reversing the order of these two resamplings is still possible. Furthermore if 
r can be obtained as Qc{t) for some t then r can also be obtained as Qc'{t) for some t. Now assume that 
the claim does not hold. In this case there must exist indices i and j with qc{vi) = j + 1 and qcivy) 7^ j 
for any i' < i. But this means that during the construction of r = gc{t) when considering time step j we 
did not attach a new vertex labeled C(j), or if we did, it went on the level of Vi or even higher. In either 
case C(j) must not be lopsidependent from C{j + 1) = [vi], neither can C{j) and C{j + 1) coincide. Thus 
the log C' obtained from C by switching C{j) and C{j + 1) is also a possible log producing r as a lopsided 
witness tree, and we also have w{C') < w{C), a contradiction. □ 

7 Conclusion 

The bound x{A)/{l — x{A)) in Theorem 11.21 is tight, but it is only achieved in the rather uninteresting 
case, when A is an isolated vertex of the dependency graph and we set x{A) = Fr[A]. Consequently the 
bound on the total number of resampling steps is achieved only if all the events in A are independent. To 
see that the expected number of resamplings of an event A A cannot achieve the bound x{A) /(I — x{A)) 
unless A is isolated in the dependency graph notice that for equality any proper witness tree with the root 
labelled A must be present in the log with positive probability, but only those can be present where the 
labels on each level form an independent set in the dependency graph. 

To implement the sequential version of our algorithm we need only to assume that we have an algorithm 
sampling the variables in and another that finds the set of violated events for a given evaluation of the 
variables. For the parallel version we also need to be able to find a maximal independent set of violated 
vertices of the dependence graph. Luby's randomized algorithm finds a maximal independent set in any 
graph in logarithmic expected time using a processor associated with each vertex of the graph. As we 
have to find an expected logarithmic number of maximal independent sets, one after the other, the total 
running time of this part of the algorithm is O(log^m), where m = \V{G)\ = \A\. Sampling and finding 
the violated indices is typically faster, so this might be the bottleneck of our algorithm. One can reduce 
the time required to find an independent set to a constant by implementing only a single step of Luby's 
algorithm. The result will not be a maximal independent set, nevertheless we can use it to perform the 
corresponding resamplings in parallel, but the expected number of steps this version of the algorithm takes 
is yet to be analyzed. 

Finally, it remains an interesting open question whether it is possible to derandomize our algorithm 
even if the degrees of the dependency graph are unbounded. 
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